Set Covering-based Feature Selection of Large-scale Omics Data
نویسندگان
چکیده
منابع مشابه
A Lagrangian-based heuristic for large-scale set covering problems
We present a new Lagrangian-based heuristic for solving large-scale set-covering problems arising from crew-scheduling at the Italian Railways (Ferrovie dello Stato). Our heuristic obtained impressive results when compared to state of the art codes on a test-bed provided by the company, which includes instances with sizes ranging from 50,000 variables and 500 constraints to 1,000,000 variables ...
متن کاملParallel Large-Scale Feature Selection
The set of features used by a learning algorithm can have a dramatic impact on the performance of that algorithm. Including extraneous features can make the learning problem harder by adding useless, noisy dimensions that lead to over-fitting and increased computational complexity. Conversely, leaving out useful features can deprive the model of important signals. The problem of feature selecti...
متن کاملVisual Learning by Set Covering Machine with Efficient Feature Selection
In this paper, we propose a new visual learning method for real-world object recognition task. Our method is based on the Set Covering Machine (SCM), to make the learning time shorter than the methods based on commonly used trial-and-error algorithms, such as genetic programming and reinforcement learning. Generally, the process of visual learning is quite time-consuming because image data cons...
متن کاملFeature Selection Based on Run Covering
This paper proposes a new feature selection algorithm. First, the data at every attribute are sorted. The continuously distributed data with the same class labels are grouped into runs. The runs whose length is greater than a given threshold are selected as “valid” runs, which enclose the instances separable from the other classes. Second, we count how many runs cover every instance and check h...
متن کاملRegression-Based Feature Selection on Large Scale Human Activity Recognition
In this paper, we present an approach for regression-based feature selection in human activity recognition. Due to high dimensional features in human activity recognition, the model may have over-fitting and can’t learn parameters well. Moreover, the features are redundant or irrelevant. The goal is to select important discriminating features to recognize the human activities in videos. R-Squar...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the Korean Operations Research and Management Science Society
سال: 2014
ISSN: 1225-1119
DOI: 10.7737/jkorms.2014.39.4.075